Identification of Patients with Family History of Pancreatic Cancer - Investigation of an NLP System Portability
نویسندگان
چکیده
In this study we have developed a rule-based natural language processing (NLP) system to identify patients with family history of pancreatic cancer. The algorithm was developed in a Unstructured Information Management Architecture (UIMA) framework and consisted of section segmentation, relation discovery, and negation detection. The system was evaluated on data from two institutions. The family history identification precision was consistent across the institutions shifting from 88.9% on Indiana University (IU) dataset to 87.8% on Mayo Clinic dataset. Customizing the algorithm on the the Mayo Clinic data, increased its precision to 88.1%. The family member relation discovery achieved precision, recall, and F-measure of 75.3%, 91.6% and 82.6% respectively. Negation detection resulted in precision of 99.1%. The results show that rule-based NLP approaches for specific information extraction tasks are portable across institutions; however customization of the algorithm on the new dataset improves its performance.
منابع مشابه
Thoracoscopic Splanchnicectomy for Pain Control in Irresectable Pancreatic Cancer
Introduction : Severepain is a major problem in patients with unresectable pancreatic cancer. The goal of this study is to evaluate the effects of Thoracoscopic Splanchnicectomy (TS) on pain control in these patients suffering from unresectable pancreatic cancer. Methods:Between years 2000 to 2011, 20 patients suffering from unresectable pancreatic cancer underwent TS due to severe pain. They w...
متن کاملHealth education models application by peer group for improving breast cancer screening among Iranian women with a family history of breast cancer: A randomized control trial
Background: Studies have shown that participation of Iranian women with family history of breast cancer in screening service is low. This investigation has evaluated the effectiveness of health models according to peer group in improving clinical breast exam (CBE) among Iranian women with a family history of breast cancer. Methods: This was a randomized control ...
متن کاملThe impact of BMI, Smoking, Family History and Ala 119 Ser (rs1056827) Polymorphism of CYP1B1*2 Genes with Susceptibility to Prostate Cancer among Iranian Men
Background and Aims: The genes involved in detoxification and the elimination of toxic metabolites have a vital role in cancer pathogenesis. Also, there is evidence that higher amounts of body fat are associated with increased risks of several cancers. The current study aims to identify the relationship of age, body mass index (BMI), smoking, family history, and polymorphism rs1056827 of CYP1B1...
متن کاملComparing methods for identifying pancreatic cancer patients using electronic data sources.
We sought to determine the accuracy of two electronic methods of identifying pancreatic cancer in a cohort of pancreatic cyst patients, and to examine the reasons for identification failure. We used the International Classification of Diseases, 9(th) Edition (ICD-9) codes and natural language processing (NLP) technology to identify pancreatic cancer in these patients. We compared both methods t...
متن کاملInvestigation of HER-2 expression and its Correlation with clinicopathological parameters and overall survival of esophageal squamous cell carcinoma patients
Background & Objective: Human epidermal growth factor receptor 2 (HER-2) exhibits a vast range of expression in esophageal squamous...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Studies in health technology and informatics
دوره 216 شماره
صفحات -
تاریخ انتشار 2015